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2x2 crossover design 
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Abstract: When conducting a paired 2x2 crossover design, each subject 
is paired with another subject with similar characteristics. The pair is then 
randomized to the same sequence of two treatments. That is, the two sub- 
jects receive the first experimental treatment, and then they cross over and 
receive the other experimental treatment (s). The paired 2x2 crossover design 
that was used in the Beta Adrenergic Response by GEnotype (BARGE) Study 
conducted by the National Heart, Lung and Blood Institute's Asthma Clini- 
cal Research Network (ACRN) has been described elsewhere. When the data 
arising from such a design are balanced and complete - or if at least any miss- 
ingness that occurs is at random - general linear mixed-effects model methods 
can be used to analyze the data. In this paper, we present a method based 
on a pattern-mixture model for analyzing the data arising from a paired 2x2 
crossover design when some of the data are missing in a non-ignorable fashion. 
Because of its inherent scientific interest, we focus our particular attention on 
the estimation of the treatment-by-type of subject interaction term. Finally, 
we illustrate the pattern-mixture model methods described in this paper on 
the data arising from the BARGE study. 



1. Introduction 

Two important design principles are occasionally used in clinical trials: 1) A subject 
is "matched" or "paired" with another subject with similar characteristics to reduce 
the chance that other variables obscure the primary comparison of interest. 2) A 
subject serves as his or her own control by "crossing over" from one treatment to 
another during the course of an experiment. Experiments employing the first design 
principle are called matched pair designs (Cochran [1]), while those employing the 
second are called crossover designs (Jones and Kenward [2], Ratkowsky ct al. [-i] 
and Senn [4]). 

There arc situations in which it may be beneficial to use the two design princi- 
ples simultaneously. That is, it may be advantageous to conduct a "paired crossover 
design." For such a design, each subject and his/her paired counterpart arc ran- 
domized to the same treatment sequence. That is, they receive one experimental 
treatment, and then cross over and receive the other experimental treatment(s) at 
the same time. A paired crossover design was recently used in three clinical tri- 
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als conducted by the National Heart, Lung and Blood Institute's Asthma Clinical 
Research Network (ACRN) (Kephart [5]; www.acrn.org). 

In particular, the Beta Adrenergic Response by GEnotype (BARGE) Study (Is- 
rael et al. [()]) used a paired 2x2 crossover design to compare the regular use of 
inhaled albuterol (A) to placebo (P) in patients with the Arg/Arg genotype and 
the Gly/Gly genotype at the 16*'' position of the beta-agonist receptor gene. It was 
hypothesized that the scheduled daily use of inhaled albuterol, the most common 
treatment for patients with mild to moderate asthma, actually had a detrimental 
effect on the lung function of patients with the Arg/Arg genotype (R) but not 
patients with the Gly/Gly genotype (G). The primary research question therefore 
concerned whether or not the treatment effects differed for the two genotypes. That 
is, the primary hypothesis concerned inference about whether the interaction pa- 
rameter: 

7 if^BA - fJ^Bp) - {l-i-GA - ^J.Gp) 

is 0, where ^ki is the population mean lung function of genotype k patients on 
treatment I. 

To achieve an efficient comparison of the treatments within each genotype, a 2 x 2 
crossover design was used within each genotype k. Because a subject's genotype 
is a pre-determined characteristic, subjects could not be randomly allocated to 
genotypic group. Therefore, to minimize confounding caused by potential differences 
in the baseline lung function of the two genotypic groups, each subject of genotype 
R was matched to a subject of genotype G with similar baseline lung function. The 
matched subjects were randomly assigned to the same sequence of the crossover 
design, and an eight-week washout period was placed between the two treatment 
periods (Table 1). 

At the conclusion of the BARGE Study, each pair of subjects j of sequence s 
ideally yielded the quadrivariate response Ysj = iYsjRA,YsjRp,YsjGATYsjGp)i a 
vector containing the subjects' changes in lung function. Unfortunately, as should 
be expected when conducting any clinical trial, measurements in the BARGE Study 
were not always collected as planned. That is, some subjects missed one or more 
planned visits or completely dropped out of the study. In this paper, we present 
a method based on a pattern-mixture model for analyzing the data arising from a 
paired 2x2 crossover design when some of the data are missing. 

In general, if = {yn, . . . , yip)' denotes the p x 1 vector of intended repeated 
measurements on subject i and m,; ~ {mn, . . . ,mipy represents a p x 1 vector of 
random indicator variables denoting whether or not response yij is observed, then 
pattern-mixture models factor the joint distribution of and nii as: 

TT [yj, niilXj] = TT [yi|Xi, nii] x tt [m^lXi] , 

where denotes fixed covariates or design matrices. That is, in short, the data are 
stratified by their patterns of missingness, and then a separate model is specified 

Table 1 

Paired crossover design for the BARGE Study. Pairs of subjects were randomized 
to either the AP sequence (first row) or the PA sequence (second row). All 
subjects had a washout period between the two treatment periods 





Subject 1 (R) 


Washout 


Subject 2 (G) 


Period 


1 2 




1 2 


Sequence 1 


RA RP 




GA GP 


Sequence 2 


RP RA 




GP GA 
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for each missing data pattern. The distribution 7r[yi|Xi,mi] models the within- 
subject regressions for each missing data pattern, and 7r[mi|Xi] models the marginal 
proportions of each missing data pattern as functions of between-subject covariates. 

It should be noted that there does exist a more general random-coefficient 
pattern-mixture model, e.g., see Little [7]. We do not pursue the more general 
model here, however, because our crossover model for the BARGE Study assumes 
that only one measurement is made in each period. 

The type of "missingness" that exists really should dictate our final analysis. 
Rubin [ ] and Little and Rubin [!)] describe three types of missingness for which 
we need not apply a pattern-mixture model: i) if the data are missing completely 
at random (the probability of response is independent of both the observed and 
unobserved data); ii) if covariate-dependent dropout exists (the missingness depends 
on fixed covariates in the model); and iii) if the data are missing at random (the 
probability of response depends on the observed data but not the missing data) . In 
any of these three cases, we simply can apply a general linear model with correlated 
errors to the data arising from a paired crossover design, as was performed by 
Simon and Chinchilli [10]. The resulting likelihood-based estimation and inference 
procedures are asymptotically unbiased. 

If the missing values satisfy none of the three types mentioned above, then Lit- 
tle and Rubin [: ] label the situation "non-ignorable" (the probability of response 
depends on the unobserved data), and the analysis of the available data require 
special methods. In this situation, when analyzing the data arising from a paired 
crossover design, we propose applying the pattern-mixture model methods that we 
now describe. 

2. Methods 

In defining a pattern- mixture model for a paired 2x2 crossover experiment, we 
assume that each subject is scheduled to contribute one response per period. The 
response may truly be a single post-treatment, outcome measure, such as forced 
expiratory volume in one second {"FEVi") in an asthma trial. Alternatively, and 
perhaps more commonly, the response could be a summary of multiple, repeated 
outcome measures, such as the change in response from pre- to post-treatment or the 
area under a dose-response curve ( ^^AU C" ) in a bioequivalence trial. We also assume 
that the subjects do not contribute any additional covariates. Thus, in general, if 
we refer to the two types of subjects who are matched as 1 and 2 and we label the 
treatments as A and B, then we expect each pair of subjects j of sequence s to 
yield only the quadrivariate response Ygj = {YsjiA,YsjiB,Ysj2A,Ysj2B)- To define 
a pattern-mixture model for a paired 2x2 crossover design that accommodates non- 
ignorable missing data, we must first consider the possible patterns of missingness. 

2. 1 . Patterns of missingness 

When modeling missing data in a clinical trial, it is common to assume that the 
missingness happens monotonically, i.e., after the first missing data point for a 
subject, all of the subsequent anticipated data points for that subject are also 
missing. For example, the data vector for a subject i might look something like 
Yi = (j/ii, yi2, 2/i3, ■)' assuming the subject misses all measurement occasions 
after the third occasion. The monotonic missingness assumption greatly simplifies 
the model development and subsequent analysis. 
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Unfortunately, for a paired crossover design, the monotonicity assumption is not 
realistic. The data vector ysj for the paired 2x2 crossover design contains data 
on the two subjects within a pair. Therefore, data vectors with non-monotonic 
missingness that look like ysj = {usjiA, ■,ysj2Ai are quite possible in a paired 
crossover design. One could still hope to simplify the problem by assuming that 
at least the missingness with respect to each subject is monotonic. However, the 
BARGE Study data set, to which we later apply the methods described herein, 
contains non-monotonic missingness within subjects. Therefore, we do not make 
any simplifying assumptions, but instead address the problem in its full generality. 

Another complication of handling missing data in a crossover design (paired 
or otherwise) is that, even when the same pattern of missingness occurs, different 
information is gleaned about the treatments from the subjects in different sequences. 
For example, for our paired 2x2 crossover design, data vectors for the first sequence, 
Yij, which are missing the last data point: 

yi] = [yijiA,yijiB,yij2A,-)' 

provide no information about the type 2-treatment B combination, while data vec- 
tors for the second sequence, y2j, which are missing the last data point: 

y2j = {y2jiB,y2jiA,y2j2B, •)' 

provide no information about the type 2-treatment A combination. Therefore, 
pattern-mixture models for any crossover design must accommodate patterns of 
missingness for each of the sequences. 

Let Pps denote the pattern of missingness for pattern p = 0, 1, . . . , 14 for sequence 
s = 1,2. Table 2 summarizes, for the paired 2x2 crossover design, the eight 
potential patterns of missingness that are monotonic within subject, while Table 3 
summarizes the remaining seven patterns of missingness that are not monotonic 
within subject. 

Table 2 

Eight patterns of missingness in paired 2-by-2 crossover design that are monotonic 
within subject, X denotes observed and ? denotes missing 



Subject 1 (R) Subject 2 (G) 



Pattern 


Sequence 


Per 1 


Per 2 


Per 1 


Per 2 


Poi 


1 (AB) 


X 


X 


X 


X 


P02 


2 (BA) 


X 


X 


X 


X 


Pll 


1 (AB) 


X 


X 


X 


? 


Pl2 


2 (BA) 


X 


X 


X 


7 


P2I 


1 (AB) 


X 


? 


X 


X 


P22 


2 (BA) 


X 


? 


X 


X 


P3I 


1 (AB) 


X 


? 


X 


7 


P32 


2 (BA) 


X 


? 


X 


7 


P4I 


1 (AB) 


X 


X 


? 


7 


P42 


2 (BA) 


X 


X 


7 


7 


P5I 


1 (AB) 


? 


? 


X 


X 




2 (BA) 


? 


? 


X 


X 


Pel 


1 (AB) 


X 


? 


7 


7 


P62 


2 (BA) 


X 


? 


7 


7 


Pn 


1 (AB) 


? 


? 


X 


7 


P72 


2 (BA) 


? 


7 


X 


7 
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Table 3 

Seven patterns of missingness in paired 2-by-2 crossover design that are non-monotonic 
within subject. X denotes observed and ? denotes missing 



Subject 1 (R) Subject 2 (G) 



Pattern 


Sequence 


Per 1 


Per 2 


Per 1 


Per 2 


^81 


1 (AB) 


7 


7 


? 


X 


-P82 


2 (BA) 


? 


7 


7 


X 


-Pgi 


1 (AB) 


? 


X 


7 


? 


P92 


2 (BA) 


? 


X 


7 


? 


Pw,i 


1 (AB) 


? 


X 


X 


X 


P10.2 


2 (BA) 


7 


X 


X 


X 


Pll,l 


1 (AB) 


X 


X 


7 


X 


Pll,2 


2 (BA) 


X 


X 


7 


X 


Pl2,l 


1 (AB) 


7 


X 


7 


X 


Pl2,2 


2 (BA) 


7 


X 


7 


X 




1 (AB) 


7 


X 


X 


7 


Pl3,2 


2 (BA) 


7 


X 


X 


7 


P14.I 


1 (AB) 


X 


7 


7 


X 


P14.2 


2 (BA) 


X 


7 


7 


X 



Complicating issues arise when modeling the data from a 2 x 2 crossover experi- 
ment even when the data are complete. Therefore, before defining a pattern-mixture 
model that accommodates non-ignorable missing data, we first define a statistical 
model assuming the data are complete. 

2.2. A paired 2x2 crossover model for complete data 

In defining a statistical model for the paired 2x2 crossover design, one of the 
complicating issues is the handling of "treatment carryover." In an AB sequence, 
a subject receives treatment A in the first period and treatment B in the second 
period. In this case, the "A carryover" (A^i) is the component of the response in the 
second period due to the lasting effect of treatment A from the first period (Jones 
and Kenward [2], Ratkowsky et al. [S] and Senn [4]). Likewise, in a BA sequence, 
the "B carryover" (As) is the component of the response in the second period due 
to the lasting effect of treatment B from the first period. In a 2 x 2 crossover, if 
the magnitude of the A carryover differs from that of the B carryover {Xa 7^ As), 
it is not possible to estimate the primary quantity of interest - the true treatment 
effect — Ms- 

There are two situations in which it is possible to estimate the true treatment 
effect: 1) if the carryover effects are the same for each treatment {Xa ~ A^); or 
2) if the time periods between the treatment periods, i.e., the washout periods, 
are designed to be lengthy enough to render both carryover effects negligible {Xa = 
Xb =0). Most 2x2 crossover experiments - the BARGE Study included - institute 
a substantial washout period between the treatment periods with the intent of 
eliminating any carryover effects. We therefore proceed in formulating a model 
assuming the second situation holds. In general, if unequal carryover effects are 
suspected to exist, then a design more complex than the 2x2 crossover is needed. 

A general linear model with correlated errors - which is a special case of the 
general linear mixed-effects model described by Laird and Ware [If] - for the paired 
2x2 crossover design with null carryover effects can be applied and is stated as: 
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where: 

• Ysj = {YsjiA,ysjiB,ysj2A,ysj2By is the quadrivariate response for pair j 
(1,2, ...,ns) of sequence s (1 = AB,2 = BA). 

• Xgj is a 4x 8 fixed-effects design matrix (in a complete data situation, Xij and 

do not depend on j, i.e., X^j = Xs for all 5 = 1,2 and j = 1, 2, . . . , rig). 

• (3 = (/xiA, MiB, ^J'2B, Pi, P2, vi, V2)' is an 8 X 1 fixed-effects parameter vec- 
tor containing four type-by-treatment means, two type-by-period effects, and 
two type-by-sequence effects. 

• £sj = {SsjiA, £sjiB, £sj2A, £s]2bY is a random error term. 

The mean of the responses derives directly from the Xsj(3 portion of the model, 
while the variance of the responses derives directly from the Sgj portion of the 
model. 

2.2.1. Mean of the responses 

Our proposed model for the mean of the responses arising from a paired 2x2 
crossover design is a straightforward extension of the model commonly assumed for 
the mean of the responses arising from a basic 2x2 crossover design. That is, we 
assume that the mean of the response in any cell of the paired 2x2 crossover design 
is a function of the subject type and treatment {fJ.ki), as well as the period (pk) 
and sequence {i^i) in which the treatments are taken. More specifically, we propose 
parameterizing the mean of the responses as: 



Type Sequence Period 1 Period 2 



1 


AB 




Ml 


+ yi 


PlB 


- Pi 


+ Vl 


1 


BA 


^J■1B - 


Ml 


- I'l 


PlA 


- pi 


- I/l 


2 


AB 




h/92 


+ 1^2 


P2B 


- P2 


+ 1^2 


2 


BA 


P-2B - 


1- P2 


- V2 


P2A 


- P2 


- V2 



2.2.2. Variance of the responses 

Traditionally, the variance matrix for a crossover design is assumed to be compound 
symmetric. That is, the variance of the responses arising from one treatment is 
assumed to equal the variances of the responses arising from other treatments. And, 
the covariances between the responses of the subjects when on different treatments 
are also assumed to be equal. Like the approach of others (Ekbohm and Melander 
[12], Sheiner [13], ChinchiUi and Esinhart [14] and Putt and Chinchilli [15]), we 
instead model the maximum number of variance components permitted for the 
paired 2x2 crossover design. That is, the variance, S, of the response Ysj, 5=1,2: 

^lAAA (^lA.lB ^1A,2A CriA,2B 

^lAAB CriB,lB ^1B,2A CFiB,2B 

<^1A.2A <^1B.2A <y2A.2A ^2A,2B 

(yiA.2B CriB:2B 02A,2B <^2B,2B 

is much more flexible than the compound symmetric structure traditionally as- 
sumed. 
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2.2.3. The data 

As the paired 2x2 crossover model for the complete data situation suggests, we 
assume that the data are balanced (the occasions of measurement are the same for 
all subjects) and complete (measurements are available at each planned occasion 
for each subject). However, the model is appropriate even when the data are not 
complete but the missingness occurs at random (the probability of response depends 
on the observed data but not the missing data). We now define a pattern-mixture 
model for when the missingness is non-ignorable. 

2.3. A pattern-mixture model 

Attempting to define a model for each of the fifteen missing data patterns yields 
identifiability problems. Some of the patterns will naturally be more populated, 
while others will be sparsely populated at best. For the sake of illustration. Table 
4 summarizes the number of subject pairs in the BARGE Study falling into each 
of the fifteen missing data patterns. Table 4 indicates that there were no non- 
monotonic patterns observed in the BARGE Study, which is not suprising because 
of the longitudinal nature of the trial design. 

In general, because of the inherent identifiability problems, it is necessary to 
collapse the fifteen patterns into coarser groupings, so that information about the 
effects parameters can be "borrowed" across the patterns. When considering po- 
tential groupings, one should take into account the area of scientific research, as 
well as the various characteristics of the subjects and/or pairs that might yield 
group differences. In creating groupings for the BARGE Study data, we propose 
the formation of the following three groups: 

1. A group containing the completers who have a pair match - patterns 0, 10, 
11, and 12. This group is denoted group "C" for "completers." 

2. A group in which all patterns involve missing a second period, regardless of 
whether a pair match exists - patterns 1, 2, 3, 6, 7, 13 and 14. This group is 
denoted group "D" for missingness due to "dropout." 

3. A group of the patterns in which a subject is a completer but does not have 
a pair match - patterns 4, 5, 8 and 9. This group is denoted group "P" for 
missingness due to a missing "pair." 

Collapsing the counts in Table 4 for the BARGE Study according to these criteria 
yields that nc = 29, no = 6, and np = 5. 

It should be emphasized that other groupings are possible besides the one that 
we propose. Some of the other groupings, however, require one to assume that no 
period and no sequence effects exist. That is an assumption that we were unwilling 
to make for the BARGE Study. It should be noted, though, that if there were no 



Table 4 

Frequencies of patterns observed in the BARGE Study 



Pattern 


Count 


Pattern 


Count 


Pattern 


Count 





29 


5 


2 


10 





1 


1 


6 


1 


11 





2 


1 


7 


3 


12 





3 





8 





13 





4 


3 


9 





14 
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period nor sequence effects to worry about, the methods described by Little (f 995) 
could be applied instead. 

Now that our groupings are defined, let: 

• Ygj denote the 4 x f complete data vector for pair j in sequence s. 

• Ypsj denote the r x I observed (reduced) data vector for pair j in pattern p 
and sequence s. 

• = 4^')' denote the 8 x 1 fixed eflfects 
location parameter vector for group g = C,D, and P. 

• Xsj denote the 4x8 design matrix for pair j in sequence s = 1, 2 that links 
the complete data vector Ysj to /J'^'^^ . 

• S denote the 4x4 complete variance-covariance matrix (as defined previously) 
for the complete data vector Ysj (notice that we do not assume that the 
variance-covariance parameters differ across the patterns). 

• Eps be an r X 4 submatrix of the 4x4 identity matrix for pattern p and 
sequence s, in which rows of the identity matrix are removed according to the 
missing values in Ysj ■ The rows of the identity matrix are always considered 
in lA, IB, 2A, 2B order, and r equals the number of non-missing observations. 

Then, we define our pattern-mixture model as: 

Ypsj ~ Eps ^ Nr ^EpsXsj/J^^-', EpsEEps^ , 

where g ~ C ior patterns p = 0, 10, 11, and 12; g ^ D for patterns p = 1,2, 3, 6, 7, 13 
and 14; and g = P for patterns p = 4, 5, 8 and 9. 

For example, pattern p = 2 and sequence s = 1 is missing the IB measurement. 
Therefore, r = 3 and: 

/ 1 \ 
E21 = 1 . 
\ 1 / 

The mean of the response F2ij = (^21^1^, i^2ij2A, ^2y2B)' is: 
and the 3x3 variance-covariance matrix is: 

(^lASA ^\A,2A (yiA.IB 
<^1A,2A <^1A,2A <y2A.1B 
<^1A.2B 0-2A,2B cr2B,2B 

The proposed means and variances of the remaining patterns p and sequences s 
can be obtained similarly. 

2.4- Maximum likelihood estimation 

We can readily obtain maximum likelihood (ML) estimates of the parameters of our 
pattern-mixture model using available software such as PROC MIXED in SAS 9.1. 
The following is sample code from SAS PROC MIXED that can be used to find 
the ML estimates of the covariance parameters and the pattern-specific location 
parameters: 



E21SE22 — 
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PROC MIXED DATA = barge METHOD = ML; 
CLASS pairid position grp; 

MODEL response = mulA(grp) mulB(grp) mu2A(grp) mu2B(grp) 

rhol(grp) rho2(grp) nul(grp) nu2 (grp) /NOINT S; 
REPEATED position / SUBJECT = pairid TYPE = UN; 

RUN; 

The variable pairid represents the identification number of the pairs. The variable 
position represents the order of the data in the pair's quadrivariate response vector: 
lA is position 1, IB is position 2, 2A is position 3 and 2B is position 4. The variable 
grp represents the three groups of patterns: for our example, groups C, D, and P. 

As an alternative to using SAS PROC MIXED, we can use the log-likelihood 
function and a matrix programming language, such as S-Plus or SAS/IML, to write 
our own Fisher scoring or Newton-Raphson algorithm to find the ML estimates. 
Let: 

• Hps denote the number of pairs falling in pattern p of sequence s, and 
denote the number of pairs falling in sequence s. 

• TTps denote the true proportion of pairs falling in pattern p of sequence s. 

• (j)id denote the set of 62 identifiable parameters - 24 effects {fJ-'f^ , ■ • ■ , ^^2^'' ) , 28 
proportions (7roi,7ro2, . ■ . , ttis^i, 7ri3^2), and 10 covariance parameters {aiA,iA, 
• • • , 0-25, 2s)- 

Then, letting H*^ ~ Ep^SEpj, and using Lagrange multipliers Ai and A2 with 
constraints ttqi + ■ • • + ttu^ = 1 and 7ro2 + ■ ■ ■ + 7ri4_2 = 1, respectively, the log 
likelihood function is: 

logLY.n {(t)id) ~ 
' 14 2 

X! X] '^p^ 



- Al (1 - TToi - • • • - 7ri4_l) 
^ li 2 rips 

-A2 (1 -7ro2 ^^14,2) 2 

p=0 s=l j=l 

14 2 "p 



=0 s=l j = l 



where the value of g depends on pattern p [g = C for p = 0, 10, 11, 12; g = D for 
p = 1, 2, 3, 6, 7, 13, 14; and 5 = P for p = 4, 5, 8, 9). 



2.5. Restricted maximum likelihood estimation 



Just as is the case for ML estimation, it is possible to use SAS PROC MIXED to find 
restricted maximum likelihood (REML) estimates of the parameters of the pattern 
mixture model. We only need to make one minor modification to the SAS PROC 
MIXED code previously used to find the ML estimates (change the "METHOD = 
ML" option to "METHOD = REML" option). 

Again, alternatively we can use the restricted likelihood function and a matrix 
programming language, such as S-Plus or SAS/IML, to write one's own Fisher 
scoring or Newton-Raphson algorithm to find the REML estimates. 

Let: 
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• n = (noi, . . . , ni4_2)' be the vector of sample sizes Ups for pattern p and 
sequence s (with N = J^l^o J2l=i 

• TT = (ttqi, . . . , 7ri4.2)' be the vector of true population proportions for pattern 
p and sequence s. 

• Y = (yon, . . . ,yoi«oi> • ■ ■ ,yi4,2i, ■ • • , yi4,2ni4,2)' be the TV x 1 vector of the 
entire set of observed responses. 

• /3 = (/3("^)',/3(^)',/3(^)')' be the 24 x 1 vector of fixed effect location parame- 
ters. 

• X be the x 24 full design matrix linking Y to (3. 

• n be the N x N block-diagonal variance-covariance matrix of Y. 

• 9 he a vector containing the 10 identifiable variance components {(Tia,ia, ■ • ■ , 

0'2B,2S)- 

Then the joint likelihood of Y and n can be written as the factorization: 

^Y.n (^irf) = L^\n {(3, 0) X Ln (tt) , 

where Y|n is a multivariate iV-normal with mean X/3 and variance il, and n is 
multinomial with parameters N and tt. Then, the general procedure behind REML 
estimation of transforming the data vector Y|n such that the likelihood iY|n(/3, 6*) 
factors into two components can be applied, yielding: 

log LY,n {4>^d) - log L' {B) + log L" (/?, 0) + log (tt) . 

LcttingP = rj-i-J7-iX(X'rj-iX)-iX'J7-\ the log REML likelihood function 

1 



IS 



logL' {9) 



and: 



logi" (/3,0) 



-^log 



.log|f2|-ilog|X'f2-ix|-iY'PY 



fX'n-iX)"' - i (X'JI-iX) {j3~p 



where (3 has the form of the generalized least squares (GLS) estimator /3 = 
(X'r2~^X)~^X'r2~^Y. Using Lagrange multipliers, logLn(7r) is the log-likelihood 
of the two independent multinomial samples: 



logLn (tt) 



■ 14 2 

^ 51 ^P" T^ps 
.p=0 s=l 

-A2 (1 - 7ro2 - • • ■ 



- Al (1 - TToi 

7ri4,2) 



7I'l4,l) 



2.6. Inference about 7 



Thus far, we only have addressed estimation of the pattern-specific parameters, 
such as ^i!{a ^ Pi^\ ^'^'i ^1^^ ■ Interest, however, typically lies in inference about the 
overall population parameters, such as /iiA, Pi, and vi, not the pattern-specific 
ones. Therefore, here we extend our work of the previous sections by addressing es- 
timation of the overall population parameters. Furthermore, since use of the paired 
2x2 crossover design will often be motivated by interest in inference about the 
interaction parameter: 



7 = (MIA - Pib) - {P2A - M2b) , 
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we also derive the asymptotic distribution of 7 here. Since the expression for the 
point estimator 7 and the form of the asymptotic distribution of 7 are the same 
regardless of whether REML or ML estimation is used in obtaining 7, for the sake 
of simplicity, we proceed assuming that ML estimation is used. 

In order to define point estimators of the overall population parameters, let TTg 
denote the proportion of pairs in group g for g ~ C, D, and P. Then, for fc = 1, 2 
and I ~ A,B, the weighted average: 

(2.1) Jlki = ncfii^^ +nDl2a^ +np'fl[p 

= ^'c^^ll +T^DlJ.li' + {l-T^c -^^D)^ill' 

is the ML estimator of the overall population treatment-by-genotype mean. And, 
therefore 7 = {IIia ~ Mis) ^ (M2A — V-2b) is the ML estimator of 7. 

The estimator 7 is a function of tt^ and the estimated proportions and 

pattern-specific means. Therefore, we derive the asymptotic distribution of 7 by 
first using the asymptotic normality of ML estimators and then by an application 
of the delta method. 

Let: 

• TT = (ttc, tid)' be the 2x1 vector containing the estimated proportion of pairs 
falling into each of the first two groups. 

• TT = (ttctt/))' be the 2x1 vector containing the corresponding two true 
proportions. 

• V(7f) = Diag{'K) — tttt' denote the 2x2 variance matrix of tt. 

• jl = {jl'iA 1 Mia'' I Mia' j ■ • ■ I V-^TB 1 ^^23 ' ) be the 12x1 vector containing the 
estimated pattern-specific mean parameters. 

• fi ~ {fi'f^ y Mia' 1 Ml a' ' ■ • ■ ' M2B'' ' M2i3'' ' M2B'' ) be the 12x1 vector containing the 
corresponding true pattern-specific mean parameters. 

• V(^) denote the 12x12 variance matrix of /i. 

Then, by the asymptotic normality of maximum likelihood estimators and by 
the independence of tt and jl : 

' ' \ _ f TT 

jlN ) V M 

is asymptotically multivariate normal with mean 0i4xi and 14 x 14 variance- 
covariance matrix: 

V(7f)2x2 02X12 
0l2x2 '^{jl)l2Kl2 



V 



Now, defining the 4x1 vector of functions g{{j:' ,jl')') = {jIia, jliB,jl2A, jl2By where 
the functions jliA,jliB ijl2A and jl2B arc as defined by (2.1), the 4 x 14 Jacobian 
matrix equals J =[Ji|J2] where Ji is the 4x2 matrix: 



/ a^c) _ MP) MD) _ MP) \ 

MlA MlA MlA MlA \ 

-(C) _ -(P) -(D) _ -(P) 

T _ Mis MlB MlB MlB 

•Jl ~ -(C) -(P) -(D) -(P) 

M2A M2A M2A M2A 

-(C) _ -(P) -(D) _ -(P) 

\ M2B M2B M2B M2B / 



268 



L. J. Simon and V. M. Chinchilli 



and J2 is the 4 x 12 matrix: 






V 



7r_D TTp 


















TTp 









71(7 
















TTp 





TTc TTp) 



\ 





TTp y 



Then, an apphcation of the delta method yields that: 





( V-IA \ 






\ 








Mis 






V-2A 








V 


\ M2S ) 




\ M2B ) 





is asymptotically multivariate normal with mean O4X1 and 4x4 variance matrix 
JVJ'. 

Now, we can construct any linear combination of the population mean parame- 
ters as 7 = c'(/ii^, fiiB, f^2A, A'2b)', but the contrast vector of particular interest in 
our application is: 

c = f 1 -1 



-1 1 



Then 



and 



7 = 5 



fJ-lA 



V-IB V'2A T^2B ) ) = c' (/2iA,/ilB,M2yl,/i2B)' 



97 



d ( /iiA V-IB V-2A V-2B 



Therefore, one final application of the delta method yields that v 7V(7— 7) is asymp- 
totically normal with mean and variance V(^) = c'JVJ'c. The resulting asymp- 
totic variance ^(7) depends on rfg, V(7f) and V(/i), and therefore must be 
estimated. 



3. Results 

As described in the Introduction, the BARGE Study was a randomized paired 
2x2 crossover trial in asthma patients comparing the effects of regularly-scheduled 
inhaled albuterol to placebo (Israel et al. [6]). Patients who met the asthma eligi- 
bility criteria were genotyped at the 16th position of the beta-agonist receptor gene 
(Arg/Arg, Gly/Gly, Arg/Gly). The heterozygotes (Arg/Gly) were excluded from 
the study. The Arg/Arg and Gly/Gly patients, on the other hand, were matched 
according to their pulmonary function (as determined by their forced expiratory 
volume in one second) and randomized together to the same treatment sequence. 
The primary outcome variable was the difference in the patients' morning peak ex- 
piratory flow rates (AM PEFR, measured in liters per minute) at the start and end 
of the 16-week treatment periods. A change of 25 liters per minute was the effect 
size considered to be clinically meaningful for the sample size calculation made for 
the BARGE study protocol. 

The BARGE Study consisted of 78 randomized patients, of whom 71 yielded data 
for analysis - 35 with the Arg/Arg genotype and 36 with the Gly/Gly genotype at 
the 16th position of the beta-agonist receptor gene. There were 40 sets of subjects, 
and as mentioned in Section 2.3, nc = 29, na = 6, and np = 5. Because of the small 
samples sizes for groups D and P, which yielded non-estimable standard errors for 
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some of their estimated parameters, these two groups were pooled. Thus, the final 
pattern-mixture model for the analysis of the BARGE Study data consisted of only 
two groups (C and D+P). 

A general linear model with correlated errors, as described in Sections 2.4 and 
2.5, was applied to group C and group D+P separately that included effects for 
period, sequence, and treatment. Because group D+P had a small sample size, a 
common 4x4 variance-covariance matrix was assumed for group C and group D+P. 
The REML estimates for the treatment effects, along with their standard errors, 
appear in Table 5. It is not surprising that the standard errors for the treatment 
effect estimates in group D+P are much larger than those of group C because (1) 
group C consisted of 29 sets of patients with 116 observations, and (2) group D+P 
consisted of 11 sets of patients with 20 observations. The REML estimates for 
the weighted averages of the treatment effects, along with standard errors, appear 
in Table 6, based on the pattern-mixture methodology described in the previous 
section. 

The estimate of 7 = {(J-ra — fJ-Rp) — ifJ-GA — tJ'Gp)^ the genotype x treatment 
interaction term, is -11.7 liters per minute with standard error 22.3 {p = 0.60). 
Thus, the analysis based on the proposed pattern-mixture model does not yield 
a statistically significant result. If the patterns (groups C and D+P) are ignored 
in the analysis, then the estimate of 7 is —15.8 liters per minute with standard 
error 19.4 {p = 0.42). Although the results of the two analyses (with and without 
pattern-mixture modeling) yield similar scientific conclusions, the analysis based 
on the proposed pattern-mixture model is slightly more conservative. 

In the BARGE scientific manuscript, Israel et al. [6] applied a mixed-effects linear 
model to the repeated measurements data within each of the 16-week treatment 
periods. In other words, they used all of the available data, not just the patients' 
final 16-week measurement minus their baseline measurement. Such an approach 



Table 5 

The REML estimates of the treatment effects from the mixed- effects model analysis of groups C 
and D+P separately. Arg/Arg = R, Gly/Gly = G, Albuterol = A, Placebo = P 



Group 


Genotype 


Treatment 


Change in AM PEFR 
= Mean (Std Error) 


C 
C 
C 
C 


Arg/Arg 
Arg/Arg 
Gly/Gly 
Gly/Gly 


Placebo 
Albuterol 

Placebo 
Albuterol 


li^^'jl = 20.4 (11.7) 
M^fj = 8.1 (10.3) 
= 12.6 (17.7) 
p:]^} = 22.3 (20.3) 


D+P 
D+P 
D+P 
D+P 


Arg/Arg 
Arg/Arg 
Gly/Gly 
Gly/Gly 


Placebo 
Albuterol 

Placebo 
Albuterol 


pRP^^ = -23.7 (35.7) 
= 12.0 (40.1) 
^(^+^) = -66.8 (45.5) 
= -46.4 (45.4) 


Table 6 

The REML estimates of the weighted averages of the treatment t 
model analysis. Arg/Arg = R, Gly/Gly = G, Albuterol 


effects from the mixed- effects 
= A, Placebo = P 


Genotype 


Treatment 


Change in AM PEFR 
Mean (Std Error) 


Arg/Arg 
Arg/Arg 
Gly/Gly 
Gly/Gly 




Placebo 
Albuterol 

Placebo 
Albuterol 


'tlpp = 8.3 (13.0) 
TiRA = 9-2 (13.3) 
MGP = -9.2 (17.9) 
MGA = 3.4 (19.3) 
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yielded a more powerful and sensitive analysis than the analysis described here 
because many of the patients in pattern-mixture group D had partial data during 
the 16-week treatment periods which contributed to the overall analysis. Indeed, the 
authors of the BARGE manuscript estimated the genotype x treatment interaction 
term as 24.0 liters per minute with standard error 6.3 {p < 0.01). In a future 
communication, we plan to develop the pattern-mixture version of the mixed-effects 
linear model for the paired 2x2 crossover design that will accommodate repeated 
measurements within the randomized treatment periods. 

4. Discussion 

We have demonstrated the development of a pattern-mixture version of a general 
linear model with correlated errors for the paired 2x2 crossover design. As just 
mentioned, in a future communication we will extend our work described herein to 
account for repeated measurements within the randomized treatment periods. Such 
an approach will optimize the pattern mixture approach because dropouts could 
be redefined as early dropouts or late dropouts. 

A weakness of the pattern mixture approach to the analysis of data from a paired 
2x2 crossover design is that one or more of the pattern groups could represent 
a reasonable proportion of the sample, yet provide little data for analysis. Case in 
point, this occurred with the BARGE Study data presented here. Eleven of the 40 
sets (27.5%) were categorized into group D-fP, yet group D-|-P provided only 20 of 
the 136 observations (14.7%) in the data set. Thus, the influence of group D-l-P on 
the results of the analysis is disproportionate to the actual number of observations 
that it contributes to the analysis. This weakness could be minimized by using 
available repeated measurements within the randomized treatment periods. 

One popular alternative to the pattern-mixture model approach is multiple data 
imputation (e.g., see Schafer [16]). Multiple data imputation involves the construc- 
tion of a complete data set via estimation of the missing values according to an 
appropriate probability model, repeating this m times, estimating the treatment 
effects within each data set, and averaging the effects across the m data sets. Mul- 
tiple data imputation affords two advantages: (1) it provides complete data sets 
that are more easily analyzed via standard statistical methods, and (2) it appropri- 
ately accounts for the variability in estimating the missing values. The disadvantage 
of multiple data imputation is that the probability model for generating imputed 
values could be misspecified. If so, then the combined results could be biased. For 
example, the probability model may be based on the estimated mean and vari- 
ance structure of those experimental units with complete observations. If the data 
from dropouts have an inherently different mean and variance structure, then the 
imputed values could be misrepresentative. 
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